On the Rule-Based Parsing of Czech
نویسنده
چکیده
This paper presents various attempts to accustom the Rule-based Approach (RBA) as originally introduced by Eric Brill in 1993 (Brill 1993a), on the problem of parsing of Czech (a highly inflective language) within a dependency based syntactic framework (Sgall et al 1986). It is experimentally supported in this paper that the modification of RBA for the stated aim is neither simple nor straightforward for all attempts fail to produce such a set of rules that parses Czech with a comparable success rate to the one obtained for English or the one obtained for the currently best parser (statistical) for Czech (Collins et al. 1999).
منابع مشابه
Rule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions
This work presents a part of a more global study on the problem of parsing of Czech and on the knowledge extraction capabilities of the Rule-based method. It is shown that the successfulness of the Rule-based method for English and its unsuccessfulness for Czech, is not only due to the small cardinality of the English tagset (as it is usually claimed) but mainly depends on its structure (”regul...
متن کاملCzech-English Dependency-based Machine Translation
We present some preliminary results of a Czech-English translation system based on dependency trees. The fully automated process includes: morphological tagging, analytical and tectogrammatical parsing of Czech, tectogrammatical transfer based on lexical substitution using word-to-word translation dictionaries enhanced by the information from the English-Czech parallel corpus of WSJ, and a simp...
متن کاملCzech-English Dependency Tree-based Machine Translation
We present some preliminary results of a Czech-English translation system based on dependency trees. The fully automated process includes: morphological tagging, analytical and tectogrammatical parsing of Czech, tectogrammatical transfer based on lexical substitution using word-to-word translation dictionaries enhanced by the information from the English-Czech parallel corpus of WSJ, and a simp...
متن کاملShallow Parsing of Czech Sentence Based on Correct Morphological Disambiguation
The basis of such an approach is provided by a very complex and sophisticated rule-based morphological disambiguation which can disambiguate Czech sentence with a very high reliability, i.e. with a minimum number of errors. This is, of course, very important for any language and all the more so for Czech whose ambiguity rate is generally extremely high (as compared e.g. to other Slavic language...
متن کاملRule-based analytical parsing of Czech
This paper introduces two methods of dependency parsing by means of rules automatically inferred from training data. For Czech the accuracy of the proposed parsers is considerably behind that of the state-of-the-art parsers, but the paper brings a new method of inference of rules and applies transformation-based learning to parsing in a novel way. Moreover, the parsers use properties of Czech l...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Prague Bull. Math. Linguistics
دوره 77 شماره
صفحات -
تاریخ انتشار 2002